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CROSS-REFERENCE TO RELATED APPLICATIONS 
5 This application claims the benefit of U.S. Provisional 

Application No. 60/122,926, filed December 18, 1998 and U.S. Provisional 
Application No. 60/075,941, filed February 25, 1998, the contents of which are 
incorporated herein by reference in their entirety. 



1 0 STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
Not applicable. 

REFERENCE TO MICROHCHE APPENDIX 
Not applicable. 

15 

FIELD OF THE INVENTION 

The present invention is directed to novel human and mouse DNA 
sequences encoding a protein which, when present in mutated form, results in the 
occurrence of Best's Macular Dystrophy. 

20 

BACKGROUND OF THE INVENTION 

Macular dystrophy is a term applied to a heterogeneous group of 
diseases that collectively are the cause of severe visual loss in a large number of 
people. A common characteristic of macular dystrophy is a progressive loss of central 

25 vision resulting from the degeneration of the pigmented epithelium underlying the 
retinal macula. In many forms of macular dystrophy, the end stage of the disease 
results in legal blindness. More than 20 types of macular dystrophy are known: e.g., 
age-related macular dystrophy, Stargardt's disease, atypical vitelliform macular 
dystrophy (VMDl), Usher Syndrome Type IB, autosomal dominant neovascular 

30 inflammatory vitreoretinopathy, familial exudative vitreoretinopathy, and Best's 

macular dystrophy (also known as hereditary macular dystrophy or Best's vitelliform 
macular dystrophy (VMD2)). For a review of the macular dystrophies, see Sullivan & 
Daiger, 1996, Mol. Med. Today 2:380-386. 
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Best's Macular Dystrophy (BMD) is an inherited autosomal dominant 
macular dystrophy of unknown biochemical cause. BMD has an age of onset that can 
range from childhood to after 40. Clinical symptoms include, at early stages, an 
abnormal accumulation of the yellowish material lipofuscin in the retinal pigmented 
5 epithelium (RPE) underlying the macula. This gives rise to a characteristic "egg 
yolk" appearance of the RPE and gradual loss of visual acuity. With increasing age, 
the RPE becomes more and more disorganized, as the lipofuscin accumulations 
disperse and scarring and neovascularization take place. These changes are 
accompanied by further loss of vision. 

10 The pathological features seen in BMD are in many ways similar to the 

features seen in age-related macular dystrophy, the leading cause of blindness in older 
patients in the developed world. Age-related macular dystrophy is an extraordinarily 
difficult disease to study genetically, since by the time patients are diagnosed, their 
parents are usually no longer living and their children are still asymptomatic. Thus, 

15 family studies which have led to the discovery of the genetic basis of many other 
diseases have not been practical for age-related macular dystrophy. As there are 
currently no widely effective treatrnents for age-related macular dystrophy, it is hoped 
that study of BMD, and in particular the discovery of the underlying genetic cause of 
BMD, will shed light on age-related macular dystrophy as well. 

20 Linkage analysis has established that the gene responsible for BMD 

resides in the pericentric region of chromosome 11, at 1 lql3, near the markers 
D11S956, FCERIB, and UGB (Forsman et al., 1992, Clin. Genet. 42:156-159; Hou et> 
al., 1996, Human Heredity 46:21 1-220). Recently, the gene responsible for BMD was 
localized to a -1.7 mB PAC contig lying mostly between the markers Dl 1S1765 and 

25 UGB (Cooper et al., 1997, Genomics 41:185-192). Recombination breakpoint 

mapping in a large Swedish pedigree limited the minimum genetic region containing 
the BMD gene to a 980 kb interval flanked by the microsatellite markers Dl 1S4076 
and UGB (Graff et al., 1997, Hum. Genet. 101: 263-279). 

One difficulty in diagnosing BMD is that carriers of the diseased gene 

30 for BMD may be asymptomatic in terms of visual acuity and morphological changes 
of the RPE observable in a routine ophthalmologic examination. There does exist a 
test, the electro-oculographic examination (EOG), which detects differences in 
electrical potential between the cornea and the retina, that can distinguish 
asymptomatic BMD patients from normal individuals. However, the EOG requires 
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specialized, expensive equipment, is difficult to administer, and requires that the 
patient be present at the site of the equipment when the test is performed. It would be 
valuable to have an alternative method of diagnosing asymptomatic carriers of 
mutations in the gene responsible for BMD that is simpler, less expensive, and does 
5 not require the presence of the patient while the test is being performed. For example, 
a diagnostic test that relies on a blood sample from a patient suspected of being an 
asymptomatic carrier of BMD would be ideal. 

SUMMARY OF THE INVENTION 
10 The present invention is directed to novel human and mouse DNA 

sequences that encode the gene CGICE, which, when mutated, is responsible for 
Best's macular dystrophy. The present invention includes genomic CGICE DNA as 
well as cDNA that encodes the CGICE protein. The human genomic CGICE DNA 
is substantially free from other nucleic acids and has the nucleotide sequence shown 
15 in SEQ.ID.NO.: 1. The human cDNA encoding CGICE protein is substantially free 
from other nucleic acids and has the nucleotide sequence shown in SEQ.ID.NO.:2 or 
SEQ.ID.NO.:4. The mouse cDNA encoding CGICE protein is substantially free from 
other nucleic acids and has the nucleotide sequence shown in SEQ.ID.NO.:28. Also 
provided is CGICE protein encoded by the novel DNA sequences. The human 
20 CG ICE protein is substantially free from other proteins and has the amino acid 

sequence shown in SEQ.ID.NO.:3 or SEQ.ID.NO.:5. The mouse CGICE protein is 
substantially free from other proteins and has the amino acid sequence shown in 
SEQ.ID.NO.:29. Methods of expressing CGICE protein in recombinant systems are 
provided. Also provided are diagnostic methods that detect carriers of mutant 
25 CGICE genes. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 A-F shows the genomic DNA sequence of human CGICE 
(SEQ.ID.NO.: 1). Underlined nucleotides in capitals represent exons. The start ATG 
30 codon in exon 2 and the stop TAA codon in exon 1 1 are shown in bold italics. The 
consensus polyadenylation signal AATAAA in exon 1 1 is shown in bold. The 
alternatively spliced part of exon 7 is shown in underlined italics. The exact lengths 
of two gaps between exons 1 and 2 and between exons 7 and 8 are unknown; these 
gaps are presented as runs of ten Ns for the sake of convenience. The portion of exon 
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1 1 beginning at position 15,788 represents the 3' untranslated region; 132 base pairs 
downstream of the polyadenylation signal of the CGICE gene are multiple ESTs, 
representing the 3'- untranslated region of the ferritin heavy chain gene (FTH). FTH 
has been mapped to human chromosome 1 lql3 (Hentze et al., 1986, Proc. Nat. Acad. 
Sci 83: 7226-7230); the FTH gene was later shown to be a part of the smallest 
minimum genetic region containing the BMD gene, as determined by recombination 
breakpoint mapping in a 12 generation Swedish pedigree (Graff et al 1997, Hum. 

Genet. 101: 263-279). 

Figure 2 shows the complete sequence of the short form of human 
CGICE cDNA (SEQ.ID.NO.:2). The ATG start codon is at position 105; the TAA 

stop codon is at position 1,860. 

Figure 3 shows the complete amino acid sequence of the long form of 
human CGICE protein (SEQ.ID.NO.:3). This long form of the human CGICE 
protein is produced by translation of the short form of CGICE cDNA. 
1 5 Figure 4 shows the complete sequence of the long form of human 

CGICE cDNA (SEQ.ID.NO.:4). This long form of the human CGICE cDNA is 
produced when an alternative splice donor site is utilized in intron 7. The ATG start 
codon is at position 105; the TGA stop codon is at position 1410. 

Figure 5 shows the complete amino acid sequence of the short form of 
20 the human CGICE protein (SEQ.ID:N0.:5). This short form of the human CGICE 
orotein is produced by translation of the long form of CGICE cDNA. 

Figure 6 shows the results of sequencing runs of PCR fragments that 
represent exon 4 and adjacent intronic regions from three individuals from the 
Swedish pedigree SI, two of whom are affected with BMD. From top to bottom, the 
25 runs are: patient Sl-5 (homozygous affected with BMD), sense orientation; patient 
Sl-4 (heteroozygous affected with BMD), sense orientation; patient S 1-3 (normal 
control, unaffected sister of Sl-4), sense orientation; patient Sl-5 (affected with 
BMD), anti-sense orientation; patient Sl-4 (affected with BMD), anti-sense 
orientation; patient Sl-3 (normal control), anti-sense orientation. Reading from left to 
30 right, the mutation shows up at position 31 of the sequence shown in the case of 
patients Sl-5 and Sl-4. The mutation in family SI changes tryptophan to cysteine. 

Figure 7 shows a multiple sequence alignment of human CGICE 
protein with partial sequences of related proteins from C. elegans. Related proteins 
from C. elegans were identified by BLASTP analysis of non-redundant GenBank 
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database. This figure shows that two amino acids mutated in two different Swedish 
famines with BMD (families SI and SL76) are evolutionarily conserved. 15 of 16 
related proteins from C. elegans contain a tryptophan at the position of the mutation 
in family SI, as does the wild-type CGICE gene. Only one C. elegans protein does 

5 not have a tryptophan at the position of the mutation. In this protein (accession 
number p34577), tryptophan is changed for isofunctional phenylalanine 
(phenylalanine is highly similar to tryptophan in that it also is a hydrophobic aromatic 
amino acid). Mutation in the BMD family SL76 changes a tyrosine to histidine. 
Again, all 16 related proteins from C. elegans contain tyrosine or isofunctional 

10 phenylalanine in this position (tyrosine is highly similar to phenylalanine in that it 

also is an aromatic amino acid). 

Figure 8A-C shows the complete sequence of mouse CGICE cDNA 
(SEQ.ID.NO.:28) and mouse CGICE protein (SEQ.ID.NO.:29). 

Figure 9A-B shows an alignment of the amino acid sequences of the 
15 long form of human CGICE protein (SEQ.ID.NO.:3) and mouse CGICE protein 
(SEQ.ID.NO.:29). In this figure, CGICE is referred to as "bestrophin." 

Figure lOA-C shows the results of in situ hybridization experiments 
demonstrating that mouse CGICE mRNA expression is localized to the retinal 
pigmented epithelium cells (RPE). Figure lOA shows the results of using an 
20 antisense CG ICE probe. The antisense probe hybridizes to mouse CG ICE mRNA 
present in the various cell layers of the retina, labeling with dark bands the cells 
containing CGICE mRNA. The antisense probe strongly hybridized to the RPE cells 
and not to the cells of the other layers of the retina. Figure lOB shows the results 
using a sense CGICE probe as a control. The sense probe does not hybridize to 
25 CGICE mRNA and does not label the RPE cells. Figure IOC is a higher 

magnification of the RPE cells from Figure lOA. Human CGICE mRNA shows a 
similar distribution, being confined to the RPE cells of the human retina. 

DETAILED DESCRIPTION OF THE INVENTION 
30 For the purposes of this invention: 

"Substantially free from other proteins" means at least 90%, preferably 
95%, more preferably 99%, and even more preferably 99.9%, free of other proteins. 
Thus, a CGICE protein preparation that is substantially free from other proteins will 
contain, as a percent of its total protein, no more than 10%, preferably no more than 
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5%, more preferably no more than 1%, and even more preferably no more than 0.1%, 
of non- CGICE proteins. Whether a given CGICE protein preparation is 
substantially free from other proteins can be determined by such conventional 
techniques of assessing protein purity as, e.g., sodium dodecyl sulfate polyacrylamide 
5 gel electrophoresis (SDS-PAGE) combined with appropriate detection methods, e.g., 
silver staining or immunoblotting. 

"Substantially free from other nucleic acids" means at least 90%, 
preferably 95%, more preferably 99%, and even more preferably 99.9%, free of other 
nucleic acids. Thus, a CGICE DNA preparation that is substantially free from other 
10 nucleic acids will contain, as a percent of its total nucleic acid, no more than 10%, 
preferably no more than 5%, more preferably no more than 1%, and even more 
preferably no more than 0.1%, of non- CGICE nucleic acids. Whether a given 
CGICE DNA preparation is substantially free from other nucleic acids can be 
determined by such conventional techniques of assessing nucleic acid purity as, e.g., 
1 5 agarose gel electrophoresis combined with appropriate staining methods, e.g. , 
ethidium bromide staining, or by sequencing. 

A "conservative amino acid substitution" refers to the replacement of 
one amino acid residue by another, chemically similar, amino acid residue. Examples 
of such conservative substitutions are: substitution of one hydrophobic residue 
20 (isoleucine, leucine, valine, or methionine) for another; substitution of one polar 
residue for another polar residue of the same charge (e.g., arginine for lysine; 
glutamic acid for aspartic acid); substitution of one aromatic amino acid (tryptophan,, 
tyrosine, or phenylalanine) for another. 

The present invention relates to the identification and cloning of 
25 CGICE, a gene which, when mutated, is responsible for Best's macular dystrophy. 
That CGICE is the Best's macular dystrophy gene is supported by various 
observations: 

1 . CG 1 CE maps to the genetically defined region of human 
chromosome 1 Iql2-ql3 that has been shown to contain the Best's macular dystrophy 

30 gene. CGICE is present on two PAC^ones, 759J12and466All, that lie precisely in 
the most narrowly defined region that has been shown to contain CGICE (Cooper et 
al, 1997, Genomics 41: 185-192; Stohr et al, 1997, Genome Res. 8:48-56; Graff et al 

., 1997, Hum. Genet. 101: 263-279). 

2. CG ICE is expressed predominately in the retma. 
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3. In patients having Best's macular dystrophy, CGICE contains 
mutations in evolutionarily conserved amino acids. 

4. The CGICE genomic clones contain another gene (FTH) that 
has been physically associated with the Best's macular dystrophy region (Cooper et 

5 al, 1997.Genomics41:185-192;St6hre^a/., 1997, Genome Res. 8:48-56; Graff 
al, 1997, Hum. Genet. 101:263-279). The FTH and CGICE genes are oriented tail- 
to-tail; the distance between their polyadenylation signals is 132 bp. 

The present invention provides DNA encoding CGICE that is 
substantially free from other nucleic acids. The present invention also provides 
10 recombinant DNA molecules encoding CGICE. The present invention provides 

DNA molecules substantially free from other nucleic acids comprising the nucleotide 
sequence shown in Figure 1 as SEQ.ID.NO.:!. Analysis of SEQ.ID.NO.:l revealed 
that this genomic sequence defines a gene having 1 1 exons. These exons collectively 
have an open reading frame that encodes a protein of 585 amino acids. If an 
15 alternative splice donor site is utilized in exon 7, a cDNA containing an additional 
203 bases is produced. Although longer, this cDNA contains a shorter open reading 
frame of 1,305 bases (due to the presence of a change in reading frame that introduces 
a stop codon) that encodes a protein of 435 amino acids. Thus, the present invention 
includes two cDNA molecules encoding two forms of CGICE protein that are 
20 substantially free from other nucleic acids and have the nucleotide sequences shown 
in Figure 2 as SEQ.ID.NO.:2 and in Figure 4 as SEQ.ID.NO.:4. 

The present invention includes DNA molecules substantially free from 
other nucleic acids comprising the coding regions of SEQ.ID.NO. :2 and 
SEQ.ID.NO.:4. Accordingly, the present invention includes DNA molecules 
25 substantially free from other nucleic acids having a sequence comprising positions 
105-1,859 of SEQ.ID.NO.:2 and positions 105-1,409 of SEQ.ID.NO.:4. Also 
included are recombinant DNA molecules having a nucleotide sequence comprising 
positions 105-1,859 of SEQ.ID.NO.:2 and positions 105-1,409 of SEQ.ID.NO.:4. 

Portions of the cDNA sequences of SEQ.ID.NO.:2 and SEQ.ID.NO.:4 
30 are found in two retina-specific ESTs deposited in GenBank by The Institute for 
Genomic Research (accession numbers AA318352 and AA317489). Other ESTSs 
that correspond to this cDNA are accession numbers AA3071 19 (from a colon 
carcinoma), AA205892 (from neuronal cell line), and AA326727 (from human 
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cerebellum). A true mouse ortholog of the CGICE gene is represented in the mouse 
EST AA497726 (from mouse testis). 

The novel DNA sequences of the present invention encoding CGICE, 
in whole or in part, can be linked with other DNA sequences, i.e., DNA sequences to 
5 which CGICE is not naturally linked, to form "recombinant DNA molecules- 
encoding CGICE. Such other sequences can include DNA sequences that control 
transcription or translation such as, e.g., translation initiation sequences, promoters 
for RNA polymerase H, transcription or translation termination sequences, enhancer 
sequences, sequences that control replication in microorganisms, sequences that 
10 confer antibiotic resistance, or sequences that encode a polypeptide "tag" such as, 
e.g., a polyhistidine tract or the myc epitope. The novel DNA sequences of the 
present invention can be inserted into vectors such as plasmids, cosmids, viral 
vectors, PI artificial chromosomes, or yeast artificial chromosomes. 

Included in the present invention are DNA sequences that hybridize to 
15 at least one of SEQ.ID.NOs.: 1,2, or 4 under stringent conditions. By way of 

example, and not limitation, a procedure using conditions of high stringency is as 
follows: Prehybridization of filters containing DNA is carried out for 2 hr. to 
overnight at 65°C in buffer composed of 6X SSC. 5X Denhardt's solution, and 100 
/ig/ml denatured salmon sperm DNA. Filters are hybridized for 12 to 48 hrs at 65°C 
20 in prehybridization mixture containing 100 Mg/ml denatured salmon sperm DNA and 
5-20 X 106 cpm of 32p-iabeled probe. Washing of filters is done at 37°C for 1 hr m a 
solution containing 2X SSC, 0.1% SDS. This is followed by a wash in O.IX SSC, 
0. 1% SDS at 50°C for 45 min. before autoradiography. 

Other procedures using conditions of high stringency would include 
25 either a hybridization carried out in 5XSSC, 5X Denhardt's solution, 50% formamide 
at 42°C for 12 to 48 hours or a washing step carried out in 0.2X SSPE, 0.2% SDS at 

65°C for 30 to 60 minutes. 

Reagents mentioned in the foregoing procedures for carrying out high 
stringency hybridization are well known in the art. Details of the composition of 
30 these reagents can be found in, e.g., Sambrook, Fritsch, and Maniatis, 1989, 

Molecular Cloni n p- A T ahnratorv Manual , second edition, Cold Spring Harbor 
Laboratory Press. In addition to the foregoing, other conditions of high stringency 
which may be used are well known in the art. 
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The degeneracy of the genetic code is such that, for all but two amino 
acids, more than a single codon encodes a particular amino acid. This allows for the 
construction of synthetic DNA that encodes the CGICE protein where the nucleot.de 
sequence of the synthetic DNA differs significantly from the nucleotide sequences of 
SEQ ID NOs -2 or 4, but still encodes the same CG ICE protein as SEQ.ID.NOs.:2 or 
4 Such synthetic DNAs are intended to be within the scope of the present invention. 

Mutated forms of SEQ.lD.NOs.: 1, 2, or 4 are intended to be within the 
scope of the present invention. In particular, mutated forms of SEQ.lD.NOs.: 1, 2, or 
4 which give rise to Best's macular dystrophy are within the scope of the present 
invention. Accordingly, the present invention includes a DNA molecule having a 
nucleotide sequence that is identical to SEQ.rD.NO.:l except that the nucleotide at 
position 7 259 of SEQ.ID.NO.: 1 is T, A, or C rather than G, so that the codon at 
positions 7,257-7,259 encodes either cysteine or is a stop codon rather than encoding 
tryptophan. Also included in the present invention is a DNA molecule having a 
nucleotide sequence that is identical to SEQ.ID.NO.: 1 except that at least one of the 
nucleotides at position 7,257 or 7,258 has been changed so that the codon at positions 
7 257-7 259 does not encode tryptophan. 

The present invention includes a DNA molecule having a nucleotide 
sequence that is identical to positions 105-1,859 of SEQ.ID.NO.:2 except that the 
nucleotide at position 383 is T, A, or C rather than G, so that the codon at positions 
381-383 encodes either cysteine or is a stop codon rather than encoding tryptophan. 
Also included in the present invention is a DNA molecule having a nucleotide 
sequence that is identical to positions 105-1,859 of SEQ.ID.NO.:2 except that at least 
one of the nucleotides at position 38 1 or 382 has been changed so that the codon at 
25 positions 38 1-383 does not encode tryptophan. 

The present invention includes a DNA molecule having a nucleotide 
sequence that is identical to positions 105-1,409 of SEQ.ID.NO.:4 except that the 
nucleotide at position 383 is T, A, or C rather than G, so that the codon at positions 
381-383 encodes either cysteine or is a stop codon rather than encoding tryptophan. 
Also included in the present invention is a DNA molecule having a nucleotide 
sequence that is identical to positions 105-1,409 of SEQ.ID.NO.:4 except that at least 
one of the nucleotides at position 381 or 382 has been changed so that the codon at 
positions 381-383 does not encode tryptophan. 
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The present invention includes a DNA molecule having a nucleotide 
sequence that is identical to SEQ.ID.NO.: 1 except that the nucleotide at position 
7 233 of SEQ.1D.no.: 1 is C, A, or G rather than T, so that the codon at positions 
7 233-7 235 does not encode tyrosine. Also included in the present mvenfon .s a 
DNA molecule having a nucleotide sequence that is identical to SEQ.m.NO.. 1 except 
that at least one of the nucleotides at position 7,234 or 7,235 has been changed so that 
the codon at positions 7,233-7,235 does not encode tyrosine. 

The present invention includes a DNA molecule having a nucleotide 
sequence that is identical to positions 105-1,859 of SEQ.ID.NO.:2 except that the 
nucleotide at position 357 is C, A, or G rather than T, so that the codon at positions 
357-359 does not encode tyrosine. Also included in the present invention is a DNA 
molecule having a nucleotide sequence that is identical to positions 105-1,859 of 
SEQ ID NO -2 except that at least one of the nucleotides at position 358 or 359 has 
been changed so that the codon at positions 357-359 does not encode tyrosine. 

The present invention includes a DNA molecule having a nucleotide 
sequence that is identical to positions 105-1,409 of SEQ.ID.NO.:4 except that the 
nucleotide at position 357 is C. A. or G rather than T, so that the codon at positions 
357-359 does not encode tyrosine. Also included in the present invention is a DNA 
molecule having a nucleotide sequence that is identical to positions 105-1,409 of 
SEQ ID NO -4 except that at least one of the nucleotides at position 358 or 359 has 
been changed so that the codon at positions 357-359 does not encode tyrosine. 

The present invention includes a DNA molecule having a nucleotide 
sequence that is identical to SEQ.ID.NO.: 1 except that the nucleotide at position 
3 330 is C rather than A. Also included in the present invention is a DNA molecule 
having a nucleotide sequence that is identical to SEQ.ID.NO.: 1 except that the 
nucleotide at position 3,330 of SEQ.ID.NO.: 1 is G, C, or T rather than A, so that the 
codon at positions 3,330-3,332 does not encode threonine. Also included in the 
present invention is a DNA molecule having a nucleotide sequence that is ident^a to 
SEQ ID NO • 1 except that at least one of the nucleotides at position 3,330 or 3,33 
has been changed so that the codon at positions 3.330-3,332 does not encode 



30 

threonine 



The present invention includes a DNA molecule having a nucleotide 
sequence that is identical to positions 105-1,859 of SEQ.ID.NO.:2 except that the 
nucleotide at position 120 is C rather than A. Also included in the present invention 
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is a DNA molecule having a nucleotide sequence that is identical to pos.tions 105- 
1 859 of SEQ ID NO.:2 except that the nucleotide at position 120 is G, C, or T rather 
than A, so that the codon at positions 120-122 does not encode threonine. Also 
included in the present invention is a DNA molecule having a nucleotide sequence 
that is Identical to positions 105-1,859 of SEQ.ID.NO.:2 except that at least one of the 
nucleotides at position 120 or 121 has been changed so that the codon at pos.tions 
120-122 does not encode threonine. 

The present invention includes a DNA molecule having a nucleotide 
sequence that is identical to positions 105-1,409 of SEQ.lD.NO.:4 except that the 
nucleotide at position 120 is C rather than A. Also included in the present mvention 
is a DNA molecule having a nucleotide sequence that is identical to positions 105- 
1 409 of SEQ ID.NO.:4 except that the nucleotide at position 120 is G, C, or T rather 
than A, so that the codon at positions 120-122 does not encode threonine. Also 
included in the present invention is a DNA molecule having a nucleotide sequence 
that IS identical to positions 105-1,409 of SEQ.ID.NO.:4 except that at least one of the 
nucleotides at position 120 or 121 has been changed so that the codon at positions 

1 20- 122 does not encode threonine. 

The present invention includes a DNA molecule having a nucleotide 
sequence that is identical to SEQ.ID.NO.: 1 except that the nucleotide at position 
8 939 is A rather than T. Also included in the present invention is a DNA molecule 
having a nucleotide sequence that is identical to SEQ.ID.NO.: 1 except that the 
nucleotide at position 8,939 of SEQ.ID.NO.: 1 is A, G, or C, rather than T, so that the 
codon at positions 8,939-8,941 does not encode tyrosine. Also included in the present 
invention is a DNA molecule having a nucleotide sequence that is ic^enncaU^^^ 
SEQ ID NO • 1 except that at least one of the nucleotides at position 8,939-8.941 has 
been changed so that the codon at positions 8,939-8,941 does not encode tyrosine^ 

The present invention includes a DNA molecule having a nucleotide 
sequence that is identical to positions 105-1.859 of SEQ.ID.NO.:2 except that the 
nucleotide at position 783 is A rather than T. Also included in the present invention 
is a DNA molecule having a nucleotide sequence that is identical to positions 105- 
1 859 of SEQ.ID.NO.:2 except that the nucleotide at position 783 is A, G, or C rather 
than T so that the codon at positions 783-785 does not encode tyrosine. Also included 
in the present invention is a DNA molecule having a nucleotide sequence that is 
identical to positions 105-1,859 of SEQ.ID.NO.:2 except that at least one of the 
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nucleotides at position 783-785 has been changed so that the codon at positions 783- 

785 does not encode tyrosine. 

The present invention includes a DNA molecule having a nucleotide 
sequence that is identical to positions 105-1,409 of SEQ.ID.NO.:4 except that the 
nucleotide at position 783 is A rather than T. Also included in the present invention 
is a DNA molecule having a nucleotide sequence that is identical to positions 105- 
1 409 of SEQ ID.NO.:4 except that the nucleotide at position 783 is A, G, or C rather 
than T, so that the codon at positions 783-785 does not encode tyrosine. Also 
included in the present invention is a DNA molecule having a nucleotide sequence 
that is identical to positions 105-1.409 of SEQ.ID.NO.:4 except that at least one of the 
nucleotides at position 783-785 has been changed so that the codon at positions 783- 
785 does not encode tyrosine. 

The present invention includes a DNA molecule having a nucleotide 
sequence that is identical to SEQ.ID.NO.: 1 except that the nucleotide at position 
1 1 241 is A rather than G. Also included in the present invention is a DNA molecule 
having a nucleotide sequence that is identical to SEQ.ID.NO.: 1 except that the 
nucleotide at position 1 1,241 is A, C, or T, rather than G, so that the codon at 
positions 1 1,240-11,242 does not encode glycine. Also included in the present 
invention is a DNA molecule having a nucleotide sequence that is identical to 
SEQ ID NO • 1 except that at least one of the nucleotides at position 1 1,240 or 1 1,241 
has been changed so that the codon at positions 1 1 ,240- 1 1 ,242 does not encode 

The present invention includes a DNA molecule having a nucleotide 
sequence that is identical to positions 105-1,859 of SEQ.ID.NO.:2 except that the 
nucleotide at position 1.000 is A rather than G. Also included in the present in vent,on 
is a DNA molecule having a nucleotide sequence that is identical to positions 105- 
1 859 of SEQ ID NO.:2 except that the nucleotide at position 1,000 is A. C. or T 
rather than G. so that the codon at positions 999-1.001 does not encode glycine. Also 
included in the present invention is a DNA molecule having a nucleotide sequence 
that is identical to positions 105-1,859 of SEQ.ID.NO.:2 except that at least one of the 
nucleotides at position 999 or 1,000 has been changed so that the codon at positions 

999- 1 ,00 1 does not encode glycine. 

Another aspect of the present invention includes host cells that have 
been engineered to contain and/or express DNA sequences encoding CGICE protein. 
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Such recombinant host cells can be cultured under suitable conditions to produce 
CGICE protein. An expression vector containing DNA encoding CGICE protem can 
be used for expression of CGICE protein in a recombinant host cell. Recombmant 
host cells may be prokaryotic or eukaryotic, mcluding but not limited to, bactena such 
as E coli fungal cells such as yeast, mammalian cells including, but not limited to. 
cell lines of human, bovine, porcine, monkey and rodent origin, and insect cells 
including but not limited to Drosophila and silkworm derived cell lines. Cell lines 
derived from mammalian species which are suitable for recombinant expression of 
CGICE protein and which are commercially available, include but are not limited to, 
L cells L-M(TK-) (ATCC CCL 1.3), L cells L-M (ATCC CCL 1.2), 293 (ATCC CRL 
1573) Raji (ATCC CCL 86), CV-1 (ATCC CCL 70), COS-1 (ATCC CRL 1650), 
COS-7 (ATCC CRL 1651), CHO-Kl (ATCC CCL 61), 3T3 (ATCC CCL 92), 
NIH/3T3 (ATCC CRL 1658), HeLa (ATCC CCL 2), C127I (ATCC CRL 1616), BS- 
C-1 (ATCC CCL 26) and MRC-5 (ATCC CCL 171). 

A variety of mammalian expression vectors can be used to express 
recombinant CGICE in mammalian cells. Commercially available mammalian 
expression vectors which are suitable include, but are not limited to, pMClneo 
(Stratagene), pSG5 (Stratagene), pcDNAI and pcDNAIamp, pcDNA3, pcDNA3.1 
PCR3.1 (invitrogen), EBO.pSV2-neo (ATCC 37593), pBPV-l(8-2) (ATCC 371 10), 
20 pdBPV-MMTneo(342-12) (ATCC 37224), pRSVgpt (ATCC 37199), pRSVneo 
(ATCC 37 198), and pSV2-dhfr (ATCC 37 146). Following expression in 
recombinant cells, CGICE can be purified by conventional techniques to a level that 

is substantially free from other proteins. 

The present invention includes CGICE protein substantially free from 
25 other proteins. The amino acid sequence of the full-length CGICE protein is shown 
in Figure 3 as SEQ.ID.NO.:3. Thus, the present invention includes CGICE protein 
substantially free from other proteins having the amino acid sequence SEQ.ID.NO.:3. 
Also included in the present invention is a CGICE protein that is produced from an 
alternatively spliced CGICE mRNA where the protein has the amino acid sequence 
30 shown in Figure 5 as SEQ.ID.no. :5. 

Mutated forms of CG ICE proteins are intended to be within the scope 
of the present invention. In particular, mutated forms of SEQ.ID.NOs.:3 and 5 that 
give rise to Best's macular dystrophy are within the scope of the present invention. 
Accordingly, the present invention includes a protein having the amino acid sequence 
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shown in Figure 3 as SEQ.ID.NO.:3 except that the amino acid at position 93 is 
cysteine rather than tryptophan. The present invention also includes a protein having 
the amino acid sequence shown in Figure 5 as SEQ.ID.NO.:5 except that the ammo 
acid at position 93 is cysteine rather than tryptophan. The present invention mcludes 
a protein having the amino acid sequence shown in Figure 3 as SEQ.ID.NO.:3 except 
that the amino acid at position 93 is not tryptophan. The present invention also 
includes a protein having the amino acid sequence shown in Figure 5 as 
SEQ.ID.NO.:5 except that the amino acid at position 93 is not tryptophan. 

The present invention includes a protein having the amino acid 
sequence shown in Figure 3 as SEQ.ID.NO.:3 except that the amino acid at position 
85 is histidine rather than tyrosine. The present invention also includes a protem 
having the amino acid sequence shown in Figure 5 as SEQ.ID.NO.:5 except that the 
amino acid at position 85 is histidine rather than tyrosine. The present invention 
includes a protein having the amino acid sequence shown in Figure 3 as 
SEQ.ID.NO.:3 except that the amino acid at position 85 is not tyrosine. The present 
invention also includes a protein having the amino acid sequence shown in Figure 5 
as SEQ.ID.NO.:5 except that the amino acid at position 85 is not tyrosine. 

The present invention includes a protein having the amino acid 
sequence shown in Figure 3 as SEQ.ID.NO.:3 except that the amino acid at position 6 
is proline rather than threonine. The present invention also includes a protein having 
the amino acid sequence shown in Figure 5 as SEQ.ID.NO.:5 except that the amino 
acid at position 6 is proline rather than threonine. The present invention includes a, 
protein having the amino acid sequence shown in Figure 3 as SEQ.ID.NO.:3 except 
. that the amino acid at position 6 is not threonine. The present invention also includes 
a protein having the amino acid sequence shown in Figure 5 as SEQ.ID.NO.:5 except 
that the amino acid at position 6 is not threonine. 

The present invention includes a protein having the amino acid 
sequence shown in Figure 3 as SEQ.ID.NO.:3 except that the amino acid at position 
227 is asparagine rather than tyrosine. The present invention also includes a protein 
having the amino acid sequence shown in Figure 5 as SEQ.ID.NO.:5 except that the 
amino acid at position 227 is asparagine rather than tyrosine. The present invention 
includes a protein having the amino acid sequence shown in Figure 3 as 
SEQ.ID.NO.:3 except that the amino acid at position 227 is not tyrosine. The present 
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invention also includes a protein having the amino acid sequence shown in Figure 5 
as SEQ.ID.NO.:5 except that the amino acid at position 227 is not tyrosine. 

The present invention includes a protein having the amino acid 
sequence shown in Figure 3 as SEQ.ID.NO.:3 except that the amino acid at position 
5 299 is glutamate rather than glycine. The present invention includes a protem having 
the amino acid sequence shown in Figure 3 as SEQ.ID.NO.:3 except that the amino 
acid at position 299 is not glycine. As with many proteins, it is possible to modify 
many of the amino acids of CGICE and still retain substantially the same biological 
activity as the original protein. Thus, the present invention includes modified CGICE 
10 proteins which have amino acid deletions, additions, or substitutions but that still 
retain substantially the same biological activity as CGICE. It is generally accepted 
that single amino acid substitutions do not usually alter the biological activity of a 
protein (see, e.g., ^/r^i^^-i.. Rinlopv of the Gene, Watson et al, 1987, Fourth Ed., 
The Benjamin/Cummings Publishing Co., Inc., page 226; and Cunningham & Wells, 
15 1989, Science 244: 108 1- 1085). Accordingly, the present invention includes 

polypeptides where one amino acid substitution has been made in SEQ.ID.NOs.:3 or 
5 wherein the polypeptides still retain substantially the same biological activity as 
CGICE The present invention also includes polypeptides where two amino acid 
substitutions have been made in SEQ.ID.NOs.:3 or 5 wherein the polypeptides still 
20 retain substantially the same biological activity as CGICE. In particular, the present 
invention includes embodiments where the above-described substitutions are 
conservative substitutions. In particular, the present invention includes embodiments 
where the above-described substitutions do not occur in positions where the ammo 
acid present in CGICE is also present in one of the C. elegans proteins whose partial 

25 sequence is shown in Figure 7. 

The CGICE proteins of the present invention may contain post- 

translational modifications, e.g., covalently linked carbohydrate. 

The present invention also includes chimeric CGICE proteins. 
Chimeric CGICE proteins consist of a contiguous polypeptide sequence of at least a 
30 portion of a CGICE protein fused to a polypeptide sequence of a non- CGICE 
protein. 

The present invention also includes isolated forms of CGICE proteins 
and CGICE DNA. By "isolated CGICE protein" or "isolated CGICE DNA" is 
meant CGICE protein or DNA encoding CGICE protein that has been isolated from 
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a natural source. Use of the term "isolated" indicates that CGICE protein or CGICE 
DNA has been removed from its normal cellular environment. Thus, an isolated 
CGICE protein may be in a cell-free solution or placed in a different cellular 
environment from that in which it occurs naturally. The term isolated does not imply 
5 that an isolated CGICE protein is the only protein present, but instead means that an 
isolated CGICE protein is at least 95% free of non-amino acid material (e.g., nucleic 
acids, lipids, carbohydrates) naturally associated with the CGICE protein. Thus, a 
CGICE protein that is expressed in bacteria or even in eukaryotic cells which do not 
naturally (i.e., without human intervention) express it through recombinant means is 
10 an "isolated CGICE protein." 

A cDNA fragment encoding full-length CGICE can be isolated from a 
human retinal cell cDNA library by using the polymerase chain reaction (PCR) 
employing suitable primer pairs. Such primer pairs can be selected based upon the 
cDNA sequence for CGICE shown in Figure 2 as SEQ.ID.NO.:2 or in Figure 4 as 
15 SEQ.ID.NO.:4. Suitable primer pairs would be, e.g.: 

CAGGGAGTCCCACCAGCC (SEQ.ID.NO.:6) and 
TCCCCATTAGGAAGCAGG (SEQ.ID.NO.:7) 
for SEQ.ID.NO.:2; and 

CAGGGAGTCCCACCAGCC (SEQ.ID.NO.:6) and 
20 TCTCCTCTTTGTTCAGGC (SEQ.ID.NO.:8) 

for SEQ.ID.no. :4. 

PCR reactions can be carried out with a variety of thermostable 
enzymes including but not limited to AmpliTaq, AmpliTaq Gold, or Vent polymerase. 
For AmpliTaq, reactions can be carried out in 10 mM Tris-Cl, pH 8.3, 2.0 mM 

25 MgCl2, 200 ^M for each dNTP, 50 mM KCl, 0.2 fiM for each primer, 10 ng of DNA 
template, 0.05 units//xl of AmpliTaq. The reactions are heated at 95°C for 3 minutes 
and then cycled 35 times using the cycling parameters of 95°C, 20 seconds, 62°C, 20 
seconds, 72°C, 3 minutes. In addition to these conditions, a variety of suitable PCR 
protocols can be found in PCR Primer. A T ^horato r y Manual, edited by C.W. 

30 Dieffenbach and G.S. Dveksler, 1995, Cold Spring Harbor Laboratory Press; or PCR 
Protocols: A Gnide to Method ^ nnri Applications. Michael et al., eds., 1990, 
Academic Press . 

A suitable cDNA library from which a clone encoding CGICE can be 
isolated would be Human Retina 5'-stretch cDNA library in lambda gtlO or lambda 
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gtl 1 vectors (catalog numbers HLl 143a and HLl 132b, Clontech, Palo Alto, CA). 
The primary clones of such a library can be subdivided into pools with each pool 
containing approximately 20,000 clones and each pool can be amplified separately. 

By this method, a cDNA fragment encoding an open reading frame of 
585 amino acids (SEQ.ID.NO.:3) or an open reading frame of 435 amino acids 
(SEQ ID NO.:5) can be obtained. This cDNA fragment can be cloned into a suitable 
cloning vector or expression vector. For example, the fragment can be cloned into the 
mammalian expression vector pcDNA3.1 (Invitrogen, San Diego, Ca). CGICE 
protein can then be produced by transferring an expression vector encoding CGICE 
or portions thereof into a suitable host cell and growing the host cell under 
appropriate conditions. CGICE protein can then be isolated by methods well known 
in the art. 

As an alternative to the above-described PCR method, a cDNA clone 
encoding CGICE can be isolated from a cDNA library using as a probe 
oligonucleotides specific for CGICE and methods well known in the art for screenmg 
cDNA libraries with oligonucleotide probes. Such methods are described in, e.g., 
Sambrook et al, 1989, Molecular Cloning: A Laboratory Manual; Cold Spring 
Harbor Laboratory, Cold Spring Harbor, New York; Glover, D.M. (ed.), 1985, DNA 
Cloning- A Practical Approach, MRL Press, Ltd., Oxford, U.K., Vol. I, H. 
Oligonucleotides that are specific for CGICE and that can be used to screen cDNA 
libraries can be readily designed based upon the cDNA sequence of CGICE shown m 
Figure 2 as SEQ.ID.NO.:2 or in Figure 4 as SEQ.ID.NO.:4 and can be synthesized by 

methods well-known in the art. 

Genomic clones containing the CGICE gene can be obtained from 
commercially available human PAC or BAC libraries available from Research 
Genetics, Huntsville, AL. PAC clones containing the CGICE gene {e.g.. PAC 
759J12, PAC 466A1 1) are commercially available from Research Genetics, 
Huntsville, AL (Catalog number for individual PAC clones is RPCI.C). 
Alternatively, one may prepare genomic libraries, especially in PI artificial 
chromosome vectors, from which genomic clones containing the CGICE can be 
isolated, using probes based upon the CGICE sequences disclosed herein. Methods 
of preparing such libraries are known in the art (loannou et a/.,1994. Nature Genet. 
6:84-89). 
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The novel DNA sequences of the present invention can be used in 
various diagnostic methods relating to Best's macular dystrophy. The present 
invention provides diagnostic methods for determining whether a patient carries a 
mutation in the CGICE gene that predisposes that patient toward the development of 
5 Best's macular dystrophy. In broad terms, such methods comprise determmmg the 
DNA sequence of a region of the CG ICE gene from the patient and companng that 
sequence to the sequence from the corresponding region of the GGIGE gene from a 
normal person, a person who does not suffer from Best's macular dystrophy. 

Such methods of diagnosis may be carried out in a variety of ways. 

10 For example, one embodiment comprises: 

(a) providing PGR primers from a region of the GG IGE gene 
where it is suspected that a patient harbors a mutation in the GG IGE gene; 

(b) performing PGR on a DNA sample from the patient to produce 

a PGR fragment from the patient; 
j5 (c) performing PGR on a control DNA sample having a nucleotide 

sequence selected from the group consisting of SEQ.ID.NOs.: 1. 2 and SEQ.ID.NO.:4 

to produce a control PGR fragment; 

(d) determining the nucleotide sequence of the PGR fragment from 

the patient and the nucleotide sequence of the control PGR fragment; 
20 (e) comparing the nucleotide sequence of the PGR fragment from 

the patient to the nucleotide sequence of the control PGR fragment; 

where a difference between the nucleotide sequence of the PGR 
fragment from the patient and the nucleotide sequence of the control PGR fragment 
indicates that the patient has a mutation in the GGIGE gene. 

In a particular embodiment, the PGR primers are from the coding 
region of the GGIGE gene, from the coding region of SEQ.ID.NOs.: 1, 2, or 4^ 

In a particular embodiment, the DNA sample from the patient is cDNA 
that has been prepared from an RNA sample from the patient. In another 
embodiment, the DNA sample from the patient is genomic DNA. 

In a particular embodiment, the nucleotide sequences of the PGR 
fragment from the patient and the control PGR fragment are determined by DNA 
sequencing. 

In a particular embodiment, the nucleotide sequences of the PGR 
fragment from the patient and the control PGR fragment are compared by direct 
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comparison after DN A sequencing. In ancher embodimen., the companson ,s made 
Z a process .ha. includes hybr,dizing .he PGR fragmen. from ,he pa.,en. and .h 

n.™, PCR fragmen. and .hen using an endonuciease .ha. Ceaves a. -™ 
p„s,.,ons in .he hybrid bu. does no. Oeave .he hybrid if .he .wo fragmenfs ma.ch 
5 p rfectly such an endonuciease is. e.,.. SI. In .his embodimen., ,he convers,on of 
mfSfmgmen. fron, .he pa,ien. .o smaller fragmen.s after endonuciease .rea.mem 
indicates .ha. .he pa.ien. carries a mu,a<.on in .he CO ICE gene. In such 
embodimems, i. may be advaniageous ,o label (radioacively. -^yn>a..c^'y|^ 
immunologically. «c.) .he PCR fragmen. from .he pa.ieM or .he con.rol PCR 

10 fn.gmen.. ^^.^^^^^^ , „e.hod of diagnosing wheiher a 

pa.ien. cames a mu.a.ion in .he CGICE gene .ha. comprises: 

(a) obuiining an RNA sample from the pauen.; 

(b) perfonn,ng reverse .ranscrip.ion-PCR (RT-PCR) on .he RNA 
15 sample using primers .ha. span a region of .he coding sequence of dre COICE gene .o 

price a PCR fragmen. from .he pa.ien, where .he PCR fragment from .he pa.,en. 
Cs a ^fined length, .he leng.h being dependent upon the .den.ity of the pr,mers .ha. 

were used in the RT-PCR; 

(c) hybridizing the PCR fragment to DNA having a sequence 

, „ cFo TD NOs 1 2 and SEQ.ID.NO.:4 to form a 
20 selected from the group consistmg of SEQ.lD.MUs.. i, z an 

' (d) treating the hybrid produced in step (c) with an endonuciease 

that cleaves at any mismatched positions in the hybrid but does not cleave the hybnd 

if the two fragments match perfectly; ^ - . u 

(e) determining whether the endonuciease cleaved the hybnd by 

detenmmng the length of the PCR fragment from the pat.ent after ^^^^^^^^ 
treatment where a reduction m the length of the PCR fragment ^^^"^ ^^^^^^^^^^^^^ 
endonuciease treatment indicates that the patient carries a mutation m the CGICE 

30 The present invention provides a method of diagnosing whether a 

patient carries a mutation in the CG ICE gene that comprises: 

(a) making cDNA from an RNA sample from the patient; 

(b) providing a set of PCR primers based upon SEQ.ID.NO.:2 or 

SEQ.ID.NO.:4; 
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(c) performing PGR on the cDNA to produce a PGR fragment 
from the patient, ^ ^^^^^^^^ ^^^^^^^^^^ ^^^^^^^^ 

(e) comparing the nucleotide sequence of the PGR fragment from 
the pat.ent with the nucleotide sequence of SEQ.ID.NO.:2 or SEQmNO„4; 

where a difference between the nucleotide sequence of the PGR 
fragment from the pat.ent wUh the nucleot.de sequence of SEQ-ID.N0.2 or 
SEO ID NO.:4 indicates that the patient carries a mutation m the GGIGE gene. 

The present invention provides a method of diagnosing whether a 
patient carries a mutation in the GGIGE gene that comprises: 

(a) preparing genomic DNA from the patient; 

(b) providing a set of PGR primers based upon SEQ.ID.NO.:!, 

SEQ.ID.NO.:2,orSEQ.ID.NO.:4; 

(c) performing PGR on the genomic DNA to produce a PGR 

.agment from the P--^^^^^^^ ^^^^^^^^^^ PC^ fragment from 

(e) comparing the nucleotide sequence of the PGR fragment from 
20 the patient with the nucleotide sequence of SEQ.ro.NO.:2 or 

where a difference between the nucleotide sequence of the PGR 
fragment from the patient with the nucleotide sequence of SEQ^N^^ ^ 
SEO ID NO :4 indicates that the patient carries a mutation in the GGIGE gene. 

In a particular embodiment, the primers are selected so that they 
25 amplify a portion of SEQ.ID.NOs.:2 or 4 that includes at least one ^-^^^^ ^^[^'^^ 
Zthegroupconsistingof.positions 120, 121, 122, 357, 358. 359, 381 2, 383, 
783 784 and 785. In another embodiment, the primers are selected so that they 
amplify portion of SEQ.ID.NOs.:2 or 4 that includes at least one position selected 
rom the group consisting of: positions 384, 385, and 386. In another em odiment, 
30 e pnme!s ar^ selected so that they amplify a portion of SEQ-^NC.^ ^^^^^^^^^^^^^^^ 
at least one position selected from the group consisting of: positions ^99, 1.000 and 
1 001 in another embodiment, the pnmers are selected so that they amplify a poition 
of SEQ ID NOs.:2 or 4 that includes at least one codon that encodes an amino acid 
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present in CGICE that is also present in the corresponding position in at least one of 
the C. elegans proteins whose partial amino acid sequence is shown in Figure 7. 

In a particular embodiment, the present invention provides a diagnostic 
method for determining whether a person carries a mutation of the CGICE gene in 
. which the G at position 383 of SEQ.ID.NO.:2 has been changed to a C. This change 
results in the creation of a Fnu4HI restriction site. By amplifying a PCR fragment 
spanning position 383 of SEQ.ID.NO.:2 from DNA or cDNA prepared from a person, 
digesting the PCR fragment with Fnu4HI, and visualizing the digestion products, e.g., 
by SDS-PAGE, one can easily determine if the person carries the G383C mutation. 
For example, one could use the PCR primer pair S'-CTCCTGCCCAGGCTTCTAC- 
3' (SEQ ID.NO.:30) and 5'-CTTGCTCTGCCTTGCCTTC-3' (SEQ.ID.NO.:31) to 
amplify a 125 base pair fragment. Heterozygotes for the G383C mutation have three 
Fnu4ffl digestion products: 125 bp, 85 bp, and 40 bp; homozygotes have two: 85 bp 
and 40 bp; and wild-type individuals have a single fragment of 125 bp. 

In a particular embodiment, the present invention provides a diagnostic 
method for determining whether a person carries a mutation of the CGICE gene in 
which the T at position 783 of SEQ.ID.NO.:2 has been changed to an A. This change 
results in the creation of a PflMI restriction site. By amplifying a PCR fragment 
spanning position 783 of SEQ.ID.NO.:2 from DNA or cDNA prepared from a person, 
digesting the PCR fragment with PflMI, and visualizing the digestion products, e.g., 
by SDS-PAGE, one can easily determine if the person carries the T783A mutation. 

The present invention also provides oligonucleotide probes, based 
upon the sequences of SEQ.ID.NOs.:l, 2, or 4, that can be used in diagnostic methods 
related to Best's macular dystrophy. In particular, the present invention includes 
DNA oligonucleotides comprising at least 18 contiguous nucleotides of at least one of 
a sequence selected from the group consisting of: SEQ.ID.NOs.: 1, 2 and 
SEQ.ID.:N0.4. Also provided by the present invention are corresponding RNA 
oligonucleotides. The DNA or RNA oligonucleotide probes can be packaged in kits. 

In addition to the diagnostic utilities described above, the present 
invention makes possible the recombinant expression of the CGICE protein in 
various cell types. Such recombinant expression makes possible the study of this 
protein so that its biochemical activity and its role in Best's macular dystrophy can be 
elucidated. 
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The present invention also makes possible the development of assays 
which measure the b.olog.cal activity of the CGICE protein. Such assays usmg 
recombmantly expressed CGICE protein are especially of interest. Assays for 
CGICE protein activity can be used to screen libraries of compounds or other sources 

5 of compounds to identify compounds that are actWators or ,nhibUors o the act^v.ty of 
CGICE protem. Such identified compounds can set^e as "leads" for the development 
of pharmaceuticals that can be used to treat patients having Best's macular dystrophy. 
In versions of the above-described assays, mutant CGICE proteins are used and 
inhibitors or actWators of the activity of the mutant CG ICE protems are discovered. 

10 Such assays comprise: 

(a) recombinantly expressing CGICE protein or mutant CGICE 

protein in a host cell; 

(b) measuring the biological activity of CGICE protem or mutant 

CGICE protein in the presence and in the absence of a substance suspected of bemg 
15 an activator or an inhibitor of CGICE protein or mutant CGICE protem, 

where a change in the biological activity of the CGICE protein or the 
mutant CGICE protein in the presence as compared to the absence of the substance 
indicates that the substance is an activator or an inhibitor of CGICE protem or mutant 

20 The present invention also includes antibodies to the CGICE protein. 

. Such antibodies may be polyclonal antibodies or monoclonal antibodies. The 
antibodies of the present invention are raised against the ent.re CGICE protein or 
against suitable antigenic fragments of the protein that are coupled to suitable earner^, 
' serum albumin or keyhole Hmpet hemocyanin, by methods well known in the art. 
25 Methods of identifying suitable antigenic fragments of a protein are known .n the art. 
see e Hopp & Woods. I98I. Proc. Natl. Acad. Sci. USA 78:3824-3828; and 
Jameson & Wolf, 1988, CABIOS (Computer Applications in the Biosciences) 4:181- 

For the production of polyclonal antibodies, CGICE protein or an 
30 antigenic fragment, coupled to a suitable earner, is injected on a periodic bas.smto an 
appropriate non-human host animal such as, ..g., rabbits, sheep, goats, rats, mtce. 
The animals are bled periodically and sera obtained are tested for the presence of 
antibodies to the injected antigen. The injections can be intramuscular, 
intraperitoneal, subcutaneous, and the like, and can be accompanied with adjuvant. 
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For the production of mOTOclonal antibodies. CO ICE protein or an 
anttgenic fragment, coupled to a suitable carrier, is injected into an appropnate non- 
huntan host anin,al as above for the production of polyclonal antibodtes In he case 
of monoclonal antibodtes. the animal ,s generally a mouse. ^"""'^'^^P^'"^'^ 
are then immortalized, often by fusion wtth a myeloma cell, as descnbed m Kohler & 
Milstein. 1975. Natu,^ 256:495-497. For a fuller description of the product.on of 
monoclonal antibodies, see > ntihodier M ahoratory Manual, Harlow & Lane. eds.. 
Cold Spring Harbor Laboratory Press, 1988. 

Gene therapy may be used to introduce CGICE polypeptides mto the 
cells of target organs, e.,. , the pigmented epithelium of the retina or other parts of the 
retina. Nucleotides encoding CGICE polypeptides can be ligated mto viral vec ors 
which mediate transfer of the nucleotides by infection of recipient cells. Suitable 
viral vectors include retrovirus, adenovirus, adeno-associated vims, herpes virus, 
vaccinia vims, and polio virus based vectors. Alternatively, nucleotides encoding 
CGICE polypeptides can be transferred into cells for gene therapy by non-viral 
techniques including receptor-mediated targeted transfer using ligand-nucleotide 
conjugates, l.pofection, membrane fusion, or direct microinjection. These procedures 
and variations thereof are suitable for ex vivo as well as in vivo gene therapy. Gene 
therapy with CGICE polypeptides will be particularly useful for the treatment of 
diseases where it is beneficial to elevate CGICE activity. 

The present invention includes DNA comprising nucleotides encoding 
mouse CGICE. Included within such DNA is the DNA sequence ^^own in Figure 
8AC(SEQ ID NO:28). Also included is DNA comprising positions 11-1,663 ot 
SEQ ID NO -28. Also included are mutant versions of DNA encoding mouse 
CGICE Included is DNA comprising nucleotides that are identical to positions 11- 
1 663 of SEQ. ID. NO.:28 except that at least one of the nucleotides at positions 26- 
28, positions 263-265, positions 287-289, positions 689-691, and/or PO-tions 905- 
907 differs from the corresponding nucleotide at positions 26-28, P— ^^^^^^^^^^^^ 
positions 287-289, positions 689-691, and/or positions 905-907 of SEQ. ID. N0..28. 
Particularly preferred versions of mutant DNAs are those in which the n-^oti^^ 
change results in a change in the corresponding encoded ammo acid The DNA 
encoding mouse CGICE can be in isolated form, can be substantially free from other 
nucleic acids, and/or can be recombinant DNA. 
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The present invention includes mouse CGICE protein (SEQ. ID. 
NO -29) This mouse CGICE protein can be m isolated form and/or can be 
sustantLy free from other protems. Mutant versions of mouse COiCB P„ 
also part of the present invention. Examples of such mutant mouse CGICE protems 
a:, proteins that are identical to SEQ. ID. NO.:29 except that the ammo ac.d a 
posLn 6, posit.on 85, posU,on 93, position 227, and/or posiuon 299 dtffe-^^^^^^^^ 
corresponding amino ac.d at position 6, pos,t,on 85, position 93, posmon 227, and/or 
position 299 m SEQ. ID. NO.:29. ^^^^ ^^^^ 

cDNA encoding mouse CGlCb can oe dmpmicu uy 
libraries made from mouse eye or mouse testis. SuUable primers can be readily 
designed based upon SEQ. ID. NO.:28. Alternatively, cDNA encodmg mouse 
CGICE can be isolated from cDNA libraries made from mouse eye or mouse test.s by 
the use of oligonucleotide probes based upon SEQ. ID. NO.:28. 

/„ situ hybridization studies demonstrated that mouse CGICE is 
specifically expressed in the retinal pigmented epithelium (see Figure 10). 

By providing DNA encoding mouse CGICE, the present mvention 
allows for the generation of an animal model of Best's macular dystrophy. Th.s 
anZl model L be generated by maUmg ".nocUout" or "Icnockm" m.ce contammg 
altered CGICE genes. Knockout mice can be generated in which portions of the 
mouse CGICE gene have been deleted. Knockin mice can be generated m which 
mutations that have been shown to lead to Best's macular dystrophy when p™ 
the human CGICE gene are introduced into the mouse gene. In particular^ mutations 
resulting in changes in amino acids 6, 85, 93, 227, or 299 of the mouse CGICE 
plin SEQ.ID NO.:29) are contemplated. Such knockout and knockin mice wi be 
valuable tools in the study of the Best's macular dystrophy disease process and will 
provide important model systems in which to test potential pharmaceuticals or 

treatments for Best's macular dystrophy. 

Methods of producing knockou. and knockin mice are well known ,n 
.he an. For example, .he use of gene-mrge.ed ES cells in .he generation of gene 
targeted .ransgentc knockout mice is described in, ..g.. Thomas e a 987 Cell 
51 503-512, and is reviewed elsewhere (Frohmanetal., 1989, Cell 56:145- 47 
Capecchi, .989.T,ends in Gene,, 5:70-76; Banbault et al., 1989, Mol. B,ol. Med. 



6:481-492). 



-24- 



0"3i&H29i&'+ .iEiEOO 



Techniques are available to inactivate or alter any genetic region to 
vtrtually any n,u,atio„ des,,^ by using targeted homologous — tj;™ -"/^^ 
specfic changes tnto chromosomal gene. Generally, use ,s made of a t^geung 
vector •• U.. a plasmid containing pan of the genettc tegion ,t ,s des.red to mutate. By 
le of the homology between this par. of the genefc re^on on the ^as-ntd a d .h^ 
corresponding genetic region on the chromosome, homologous recombtnahon can be 
us^: tnser, the plasmid into the genetic re^on, thus disrupting the genettc regton. 
Usually the targeting vector contains a selectable marker gene as well. 

in comparison with homologous extrachromosomal recombmatton, 
which occurs at frequencies approaching 100%. homologous P'-™"*-"™ 
^combination was originally repotted to only ^ detected at frequences between 10 
TdTo 3 (Lin et a,., 1985, Proc. Natl. Acad. Sci. USA 82:1391-1395., Sm.th.es e, al., 
1985 Natute 317: 230-234-, Thomas et aU 1986. Cell 44:419-428). 
Lhomologous plasmid-chromosome interactions are more frequ^^^^^ 
levels 105-fold (Lin et al., 1985, Proc. Natl. Acad. Sci. USA 82: 1391-1395 to 10 
fold (Thomas et al., 1986, Cell 44:419-428) greater than comparable homologous 

To overcome this low proportion of targeted recombination in murine 
ES cells, vanous strategies have l^n developed to detect or select «re homologous 
recombinants. One approach for detecting homologous alteration even s uses the 
polymerase chain reaction (PCR) to screen pools of transformant cells o 
LLogous inseriion, followed by screentng indiv.dual clones (Ktm et 1.. 1988, 
Nucleic Acids Res. 16:8887-8903; Ktm et al., 1991, Gene '03:227-233 ^ 
Alternatively, a positive genetic selection approach has been developed ,n whtch 
rrgene -s Istructed wh.ch w„l only be -ve .f homologou^— occur,, 
allowing these recombinants to be selected ditectly (Sedivy et al., 1989, P«>c. Natl 
tad. sl USA 86:227-23 1). one of the most ^werful apptoache^^^^ 

selecting homologous recombinants is the positive-negative selection (PNS) method 
dtli. for genes for which no dnect selection of the a"~is. (~ 
, al 1988 Nature 336:348-352; Capecchi, 1989, Science 244:1288-1292, Capecchi. 
989 Tr nds in Genet. 5:70-76). The PNS method is more efficient for targeting 
n : which a„ not e.ptessed at htgh levels because the marlcer gene has its w^ 
Lmoter. Nonhomologous tecombinants are selected agatnst by us.ng the Herpes 
Simplex virus thymidtne kinase (HSV-TK) gene and selecting agatnst its 
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„o„ho,no,ogous insertion w„h herpes drugs such as ganeydovir (GANG) or FIAU (1- 
2 2 nuo™-B.D-arabinoflura„osyl)-5-icdoumci,). By .his cou„ter-selec«o.. 

IhetrcUe of homOogous — an<s in ihe surviving ,ransfom,an.s can be 

increased. 

The follcv-ing non-linriring examples are presen.ed to berter iliustrare 

the invention. 

EXAMPLE 1 

idemificaimofthei^^ 

Constiuclion of Libraries for Shotgun Sequencing „,, ,. , 

Bacteria, strains containing the BMD PACs (PI Artrfic, 1 
Chromosomes) were received from Research Genetics (Huntsvilie, AL). The 
m!lum tiling path between markers DUS4076 and UGB that tepresents the 

gene" regio„ containtng the BMD gene incudes the fo iowrng nme PAC 
clones- 363M5 (140 kb), 519013(120 kb), 527E4 (150 kb), 688P12 (140 kbX 
74,N15 (170 kb 756B9 (120 kb), 759J12 (140 kb). 1079D9 (170 kb). and 363P2 
Zb) ™1 'tere st Jked on Luna-Bertani (LB) agar plates supplemente w,th 
fapptpriate antibiotic. A single colony was prcked up and subieced to c^ony- 
. PGR Llysis with corresponding STS primers descnbed ,n Cooper « .1 1997 
Ge" 185-192 to confirm the authentrcity of PAC clones. A s.ngle pos.trve 
cZrwas sed to prepare a S-ml starter culture and then 1-L overnight culture ,n 
colony was centrifugation and PAC DNA was punfred by 

n a AERO-Lt nebulizer (CIS-US. Bedford, MA). The nebulizer w. attach to a 
gas source and the OKA was randomly ^"-^ .^^P;"' 
,0 shea^d DNA was ethanol precrpitated and resusp^nded m TE ('» Tns mM 
EDTA) The ends were made blunt by treatment w,th Mung Bean Nuclease 
(Z ga, Madrson, Wl) at 30"C for 30 min, followed by phenol/ehloroform 
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extraction, and treatment with T4 DNA polymerase (GIBCO/BRL. Gaithersburg, 
MD) in multicore buffer (Promega, Madison, WD in the presence of 40 uM dNTPs at 
16°C To facilitate subcloning of the DNA fragments, BstX I adapters (Invitrogen. 
Carlsbad, CA) were ligated to the fragments at 14°C overnight with T4 DNA Hgase 
(Promega, Madison, WI). Adapters and DNA fragments less than 500 bp were 
removed by column chromatography using a cDNA sizing column (GIBCO/BRL. 
Gaithersburg, MD) according to the instructions provided by the manufacturer. 
Fractions containing DNA greater than 1 kb were pooled and concentrated by ethanol 
precipitation. The DNA fragments containing BstX I adapters were ligated mto the 
BstX I sites of pSHOT H which was constructed by subcloning the BstX I sites from 
pcDNA n (Invitrogen, Carlsbad, CA) into the BssH n sites of pBlueScript 
(Stratagene, La Jolla, CA). pSHOT n was prepared by digestion with BstX I 
restriction endonuclease and purified by agarose gel electrophoresis. The gel punfied 
vector DNA was extracted from the agarose by following the Prep-A-Gene (B.oRad, 
Richmond, CA) protocol. To reduce Hgation of the vector to itself, the digested 
vector was treated with calf intestinal phosphatase (GIBCO/BRL, Gaithersburg, MD. 
Ligation reactions of the DNA fragments with the cloning vector were transformed 
into ultra-competent XL-2 Blue cells (Stratagene, La Jolla, CA), and plated on LB 
agar plates supplemented with 100 ng/ml ampicillin. Individual colonies were picked 
into a 96 well plate containing 100 ^l/well of LB broth supplemented with ampicllm 
and grown overnight at 37°C. Approximately 25 nl of 80% sterile glycerol was added 
to each well and the cultures stored at -80°C. 

Preparation of plasmid DNA 

Glycerol stocks were used to inoculate 5 ml of LB broth supplemented 
with 100 jxg/ml ampicillin either manually or by using a Tecan Genesis RSP 150 
robot (Tecan AG, Hombrechtikon, Switzeriand) programmed to inoculate 96 tubes 
containing 5 ml broth from the 96 wells. The cultures were grown overnight at 37°C 
with shaking to provide aeration. Bacterial cells were pelleted by centrifugation , the 
supernatant decanted, and the cell pellet stored at -20°C. Plasmid DNA was prepared 
with a QIAGEN Bio Robot 9600 (QIAGEN, Chatsworth, CA) according to the 
Qiawell Ultra protocol. To test the frequency and size of inserts, plasmid DNA was 
digested with the restriction endonuclease Pvu H. The size of the restriction 



-27- 



20I77YP 



endonuclease products was examined by agarose gel electrophoresis with the average 
insert size being 1 to 2 kb. 

DNA Sequence Analysis of Shotgun clones 
5 DNA sequence analysis was performed using the ABI PRISM™ dye 

terminator cycle sequencing ready reaction lot with AmpliTaq DNA Poly"-;"^;; 

(Perkin Elmer, Norwalk, CT). DNA sequence analysis was performed with M13 

forward and reverse primers. Following amplification in a Perkin-Elmer 9600, the 

extension products were purified and analyzed on an ABI PRISM 377 automated 
0 sequencer (Perkin Elmer, Norwalk, CT). Approximately 4 sequencing reactions were 

performed per kb of DNA to be examined (384 sequencing reactions per each of nine 

PACs). 

Assembly of DNA sequences 

5 Phred/Phrap was used for DNA sequences assembly. This program 

was developed by Dr. Phil Green and licensed from the University of Washington 
(Seattle WA) Phred/Phrap consists of the following programs: Phred for base- 
calling Phrap for sequence assembly, Crossmatch for sequence comparisons, Consed 
and Phrapview for visualization of data. Repeatmasker for screening repetitive 

20 sequences. Vector and E. coli DNA sequences were identified by Crossmatch and 

removed from the DNA sequence assembly process. DNA sequence assembly was on 
a SUN Enterprise 4000 server running a Solaris 2.51 operating system (Sun 
Microsystems Inc., Mountain View, CA) using default Phrap parameters. The 
sequence assemblies were further analyzed using Consed and Phrapview. 

Identification of new microsatellite genetic markers from the Best's macular 
dystrophy region 

Isolation ofCA microsatellites from PAC-specific sublibraries, 
Southem blotting and hybridization of PAC DNA with a (dC-dA)„ (dG-dT) probe 
30 (Pharmacia Biotech. Uppsala, Sweden) was used to confirm the presence of CA 
repeats in nine PAC clones that represent a minimum tiling path. Shotgun PAC- 
specific sublibraries were constmcted from DNA of all 9 PAC clones using a protocol 
described above. The sublibraries were plated on agar plates, and colonies were 
transfei^d to nylon membranes and probed with randomly primed polynucleotide, 
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(dC-dA)n (dG-dT)n, Hybridization was performed overnight in a solution containing 
6X SSC 20 mM sodium phosphate buffer (pH 7.0), 1% bovine serum albumin, and 
0 2% sodium dodecyl sulfate at Filters were washed four times for 15 mm each 

in 2X SSC and 0.2% SDS at 65°C. CA-positive subclones were identified for all but 

5 one PAC clone (527E4). DNA from these subclones was isolated and sequenced as 
descrobed above for the shotgun library clones. 

Identification of simple repeat sequences in assembled DNA 
sequences. DNA sequence at the final stage of assembly was checked for the 
presence of microsatellite repeats using a Consed visualization tool of the 

10 Phred/Phrap package. 

Polymorphism analysis and recombination mapping 

Sequence fragments containing CA repeats were analyzed using the 
PRIMER program; oligonucleotide pairs flanking each of the CA repeats were 
15 synthesized. The forward primer was kinase-labeled with [gamma-32P]-ATP. 
Amplification of the genomic DNA was peformed in a total volurne of 10 m1 
containing 5 ng/^l of genomic DNA; 10 mM Tris-HCl pH 8.3; 1.5 mM MgCl2 , 0 
mMKCl- 0 01% gelatin; 200 mM dNTPs; 0.2 pmol/Ml of both primers; 0.025 un.t/Ml 
of Taq polymerase. The PCR program consisted of 94° C for 3 min followed by 30 
20 cycles of 94°C for 1 min, 55oC for 2 min. 72°C for 2 min and a final elongation step 
at 72°C for 10 min. Following amplification, samples were mixed with 2 vol of a 
formamide dye solution and run on a 6% polyacrylamide sequencing gel. Two newly 
identified markers detected two recombination events in disease chromosomes of 
individuals from family Si. This limited the minimum genetic region to the interval 
25 covered by 6 PAC clones: 519013, 759J12, 756B9, 363M5, 363P2, and 741N15. 

Identification of the retina-specific EST hit in the pCA759112-2 clone. 

A CA-positive subclone (pCA759J12-2) was identified in the shotgun 
library generated from the PAC 759j 12 DNA by hybridization to the (dC-dA)n (dG- 
30 dT)n probe. DNA sequence from pCA759J12-2 was queried against the EST 

sequences in the GenBank database using the BLAST algorithm (S.F. Altschul. et al., 
1990 J Mol. Biol. 215:403-410). The BLAST analysis identified a high degree of 
similarity between the DNA sequence obtained from the clone PCAJ59J12-2 and a 
retina-specific human EST with GenBank accession number AA318352. BLAbiX 
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• , ^- '>';hiohlv conserved aromatic residues, 
amino acid sequence including 25 highly conserv 

A human gene partially represented in pCA759J12 2 and 
AA318352 was dubbed CGICE (Candidate Gene #1 with the homology to the C. 
elegans group of genes) and selected for detaled analysis. 

Biolnformatic Analysis of Assembled DNA Sequences ^^^^p^^s 

When the assembled DNA sequences from the mne BMD PACs 

hed 0 5 1 fold coverage, the DNA contigs were randomly concatenated, and 
approached O.S-l-fold coverag , 

prediction abilities of the program package AceDB were 

In addition lo i ^^^p^ (GeneBank 

mentioned above, Genbank ^-^'^ ^l" "^^^^ were analyzed with the 

accession numbers AC003025 ^^^^^^^^^^^ „t pans of 

use of the same AceDB package. PAC clones 4C0 Genomics 41:185-192); 

.He PAC contig across the BMD region (Cooper ^^^^^^^^ ,,,, 
both clones map to the minimum genetic ^^^^^^ —'J^^^^^ ^^edish pedigree 
determined by recombination breakpoint analysis in a 12-generation F 
determine y r- „«t ini • 763-279) Datbase entnes for PACs 466 All 

(Graff., a, .. 1997. ""J^^^'jj^La^d in Phase 1 High Throughput 

363P2 -P--"' °^;,,7:;%,„on,e Scence and Technology 

Genome Sequence Project (HTGS phase oy 
2nter,UniversityofTexasSouthwestemMedical center at Dallas. 

cDNA sequence and exon/intron organization of the CGICE gene 

Genomic DNA sequences from PACs 466A1 1 and 759J12 were 
. .TTc ICE cDNA sequence from EST AA318352 using the program 

zrc:!--^^^^ 

" identification of intron/exon boundanes was then — ^^^^^^ 
Zuallycompanngvisuahzed genomic andcDNAseque.^^^ 
package. This analysis allowed the identification of exons 8, 9, and 
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, in FST AA318352. To increase the accuracy of the analysis, the DNA 
of KT Ia3.8352 »as veHfied by comparison w,th genomic sequence 

T . f ir„CA75m2 2 PAC 466A. 1, and shotgun PAC 759J.2 subclones, 
obtained from PCA759 12 2. ^^^^^ ^^^.^ 

The venfied EST ^'^3'««2 seque ^^^^^^ 
Cacce— 

^^e^n e1, no™* CT), and a consensus s^uence denved^om tht. ^Ts 
AA3 18352, AA3071 19, and AA205892) was ^-analyzed by BLAST. BLAST 
tal ,! ,de;,ir,ed a founh EST belonging to th.s clus.e. (acc^^n num 
AA317489, EST AA317489 was •-'"^^'j" ^^^^^^ 
consensus sequence derived from the four ESTs (AA3 18352, AA307 " • 
and II" 17489) was compared with genomic sequences obtained from PCA-'59 12-2, 
andAA3n48y)w ^^tclones using the programs Crossmatch 

Td'A^B' t"::" ve'Jdt'quenceandcorrecte^ 
:etrn:,n"A31S3^52,AA307119,AA205S92 a dAASm^^^^^^^ 

CDNA and genomic sequences revealed a tota o ^^-^^^'^^^^^^^ analysis 

r,:>Ti:=^^ 

tamiiy;. cmi^ r-r\rv apne was called exH 

stretch of genomic sequence, this part of the CG ICE gene was c 

,^thetL,e.7.~^^^ 

e:ride„tif.eda„add,t,— 

i a high degree of similarity to genomic sequence. Comparison ot B 

1.H the nresence of two additional exons(exl and ex2)m the CGlCt, 
sequences revealed the presence or iwo 5'-ex l-ex2- 

;:e::rri:^e:;erne,ons2and4,e,ons«and7,a„de.ons^ 

preoicuoi u^.^pen ESTs represented in exons 1 and 2 from one 

0 addUion there was no overl p etwe^^^^^^^^^^ ,,ere was the possihUUy of 

side and exons 4, 5, 6, 7, 8, 9, 10 ana represented in the 

the presence of additional exons in the CGICE gene that we 



GenBank EST database. 



-31- 



Identification of an additional exon and determmation of the exact exon/mtron 
boundaries within the CGICE gene. 

To identify additional exon(s) within the CGICE gene and venfy the 
e.on,c conipositton ofthis gene, forward and reverse ^"-'^j" ^ 
exons of the CGICE gene were synthesized and used to PCR ^7'''"^° f^^f ^„ 
fragments from human retina "Marathon-ready" cDNA (Clontech, Palo Alto. CA). h, 
these RT-PCR experiments forward primer from exl (LF: 

-:rm?:rG^^==o^^^^^ 

:Z from e. <G. O-OC-O A^^^^^^^^ , 

:;^;"rd;r^^^^^^ 

) (SEQ.ID.no.: 13) was paired with a reverse primer from ex8 (AR. 
TGTCGACAATCCAGTTGGTCT) (SEQ.ID.NO.: 14), forward pnmer from ex8 (AF. 

paired with a reverse pnmer from ex 11 (DR. GC A 1 CUi^L,/^ i 
(SEQ.ID.no.: 18). 

A 50 ^1 PGR reaction was performed using the Taq Gold DNA 
polymerase (Perktn Elmer, Norwalk. CT) in the reaction buffer supplied by the 
manufacturer with the addition of dNTPs, pnmers, and approximately 0.5 ng of 
human retina cDNA. PGR products were electrophoresed on a 2% agarose ge d 
DNA bands were excsed, punfied and subjected to sequence analys.s wUh the same 
primers that were used for PCR amplification. The assembly of the DNA sequence 
results of these PCR products revealed that: ^ = ^ 7 « g lO and 

(i) exons 1 and 2 from one side and exons 4, 5, 6, 7, 8, y, lU, ana 
11 indeed represent fragments of the same gene ^ . 

) (ii) an additional exon is present between exons 2 and 4 (named 

'"'^ (iii) exon 7 (Hypothetical) predicted by the BLASTX analysis is 

present in the CGICE cDNA fragment amplified by EF/AR primers. 
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comparison of ,he DNA sequences ob,a,„ed from RT-PCR tagmems 
. ^at;qti-7 9 PAC466All,andshotgun 
with genomic sequences obtained from pCA759J ^^-^-^^^^n,.^^ and AceDB. 
P.c™n.su.,on.^as.rfo™^^^^^^^^^^^ 

« aTo^^. A-n4S9, and AA3.6727) and >den„fled an 
irnaLZon3>,n.HeCO.CBge„e...ac.se.^^^^^^^^ 
boundaries within the COICE gene wete detennmed for a of the ex^ns. P 

Signals .n a,, introns confom. to publish ^~;^;^^J:J^ Zn 1 e.ons. 
appears to span at least 16 kb of genomrc sequence. It contams 

.„osplicedonor.tes~^^ 

bp. Both splicing sites confom. to the pubHshed consensus sequence. 

Identification of 5' and 3' ends of CGICE cDNA ^^.^ 

RACE is an established protocol for the analysis ot cUIN a 
procedure was performed using the Marathon RACE template from human retma, 
purchased from Clontech (Palo Alto, CA). cDNA pnmers KR 
^rr A AGCGGGCATTAGCCACT) (SEQ.ID.NO.: 19) and 

R^TGgS?^^^^^^^ (SEQ.ID.NO.:20) m combination with a 

LR(TGGGGTrCCAOG q^cTCACTATAGGGC ) 

cDNA adaptor pnmer API (CCA l^^ii^^ 

(SEQ.m.NO.:21, were in combinatron 

5 (OOATOAAC^ACATrCCrAACCTO™ , 

„,th a CDNA adaptor P^^^^^'^^^^^^^^,, ,,om these PCR 

r^FO TD NO -21) was used in 5 KA^-i:-. nuuu 

[^t.u.ixj,i^yj -^^j FvH^ed fraements from the gels 

wit^ The oroducts were analyzed on ABl 3 / / sequen^-c 
\0 terminator sequencing kits, ineproauci 

according to standard protocols. 
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EXAMPLE 2 

reg ion CGICE 

^ • nNA from BMD patients from two Swedish pedigrees 
Genomic DN A fron. BMi^ P ^^^^^ ^^,„g 

havingBest'smaculardystrophy (famihesSl andSL76) 

thefollowingpnmerP^n^^^^ 

exGjeft ^^^^^^ J^l^^r^TT^r fSEO ID NO.:24) 

exG_nght CTCCACCCATCTTCCGTO (^^^^^^^ ^^^^^^^^ 

This primer pair amplifies a genomic fragment that ,s 41 P 

and adjacent intronic regions. 

The patients were: 

FamilySl: „nt having BMD; sister of S 1-4 

S 1-3 a normal individual, i.e., not havmg bi 
Si-4 an individual heterozygous for BMD; and 

Intgls Observable upon opb,h.i™,ogic examination. 

"aJ«viaua,He.e™.y.ousfo,BMP;n,o.bero^^ 

SU6-2. an individual betemzygous for ^ ^„„„„,„ ,bove were 

PCR products produced usmg .he pnmer se 
an,plir,ed in 50 .1 -Uons consisUng of P^^n-Elm^ « ^^^^^^^^ ^ 

^P.s. 0.5 u, ofTacGold <^"«"-^'"";;j;'c^i„,Ld,Uons we„ as 
DN A and 0.2 (.M of forward and reverse pnmers. Cycl g 

follows: 

94°C 10 min 

94°C 30 sec i°rDer cycle) 

720C 2 min (decrease this temperature by 1.1 C per eye 

72°C 2 min 

Go to step 2 15 more times 

94°C 30 sec 
55°C 2 min 



25 



30 



1. 

2. 
3. 
4. 

5. 

6. 

7. 
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g 72°C 2 min 

9 Go to step 6 24 more times 

10. 72°C 7 min 

■ ^f.n. this PGR amplification were analyzed on 2% 
products obtained ^ o^^^^ J^^ , ,3ing Qiagen 
agarose gels and excised fragments .ylrminator sequencing kits. 

QIAquickspin columns and sequenc^^^^^^^^^^^^^ 

The products were analyzed on ABl 3 4 ^ , ^hromatogram 

The results are shown m Figure 5. Mg ^ ^^^^ 

.h^ PCR fragments from patients S 1-3, i ^. an 
from sequencing runs on the PGR g ^^^^^^^^^ ,he 

six readings represent -^"^^f^^^^^^^^^^^ ^..ents affected with BMD. patients 

patients. As can be seen from Figure 6, tw p g^.h copies 

Sl-4 and Sl-5, both carry a mutation at position 83 o^^^^^^^^ ^^^^^^^^^^^^ 
of the CGICE gene are mutated in — ^^^^^^^^f gene. This 

affected Sl-4 contains both norma, and ^^^^^^^^^ ,3 of 

mutation Changes the codon .eloding cysteine). Patient 

SE0ID.NO.:3 from TOG (encoding tryptophan) to K ^^i,,„don. This 

SEQ.ro.NO.:2. This — Z'"''"^" '7j;„,,„^ ,y„„„e) >o CAC (encoding 
position 85 of SEQ.ID.N0.:3 from TAG F AC codon «as 

. LsUdine). THUd,sease.o,aUon.a^^^^^^^^ 
not found upon sequencing of 50 normal un 

North American descent. ^^^^^ p^^^^i„ 

Amino acid positions 85 and o occupied by 

1 conserved Figure 7 demonstrates that position 93 is occupi ^^^j^^ 

proteins. The lone C. "^^^J^t^yll^^^^^ and tryptophan, both being 
an isofunctional phenylalanine instead. Ph y ^^^^.^^ 
hydrophobic, aromatic amino acids, are highly similar. 



30 
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.^os,ne and — nal p».ne in aU 16 ^^^^^^^^ 
Phenylalanine and tyrosine, both bemg aromatic amino acids, are hig y 



EXAMPLE 3 



5 F.x pressio" CGICE 

RT PCR RT-PCR experiments we.e perfonned on "quick^lone" 
..ancON^Ip-es — .„mC,o„.ecHP.o^^^^^^^^^^^ 
..„,..,„,p.ce„.a,,u^,We.s^^^^^^^ 

conditions: 

J 94°C 10 min 

04*^CI 30 sec 

3 72°C 2 min (decrease this temperature by 1.1°C per cycle) 

j5 4 72°C 2 min 

5' Go to step 2 15 more times 

g 94°C 30 sec 

7 55°C 2 min 

g 72°C 2 min 

20 9. Go to step 6 19 more times 

10. 72°C 7 min 

THe CO.CE gl^as found .o be p^dominanUy exposed in .uman reUna and brain 
Nor,ke^Uo,.nalys:s: Northern b.o,s —g poWA.)-RNA 

" .on, dirreren. Human tissue, wete purchased from '^^rl^I'^Cnd 
contained human hear,, brain placenta, lung Uver, J 
pancteas poWA.)-RN A. Blot #2 cont^ned ^^"r^^-^^ro^' ^^'"'^ 

amplify a PCR product from total genomic DNA. This product P 
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boston, jvm; p;^catawav NJ). The radiolabeled probe at a 

spin column (Pharmacia/Biotech, Piscataway,JNj; hnffer fClontech 

ft., ^han 1 X 106 com/ml in rapid hybridization butter (.cioniecn, 
concentration of greater than 1 x 10 cpn^ P 
Palo Alto, CA) was incubated overnight at 65 C. The blots 
min incubations in 2X SSC, 0.1% SDS (prepared from 20X SSC and 20 7 

, Pi.her Pittsburgh PA) at room temperature, followed by two 15 mm 

soluuons, P-"^^^^^^^^^^^ ,,,p,rature, and two 30 min incubations 

Toirs^^^ 

the bands that specifically hybridized to the radiolabeled probe. 

L probe hybridized to an mRNA transcript that is umquely 

expressed in brain and spinal cord. mrFpenewas 
Mouse probe for the murine ortholog of the GCICE gene was 

4^ TiTQT with GenBank accession number 
generated based on the sequence of an EST with GenBan 

AA497726. The 246 bp probe was amplified from mouse heart cDNA (Clontech, 
Palo Alto, CA) using the primers mouseCGlCE_L rr^rv R 

(ACACAACACATTCTGGGTGC) (SEQ.ID.NO.:26) and mouseCGlCE R 
^^crGAAACTGCTTCC^^^^ (SEQ.ID.NO.:27). Due to an extremely low 
Z^TnTZ of the CGICE gene in mouse heart, repetitive amplification steps 

werl sed to generate this probe. The authenticity of this probe was verified by _ 
wereuseaiogc . . Northern blot containing poly(A+)- 

sequence analysis of the gel punfiedDNA band. Nort ^^^^ ^^^^j, ^i^ney, 

RNA from several rat tissues (heart, brain, spleen, lung, liver, skeletal m 
RNA trom sevci hybridized to an 

. testis) was purchase from Clontech, Palo Alto, CA. ine pr 
.^PN A transcriot that is expressed in testis only. 

„RNA »„scnp^^ l^^^,,„ .„ ^ ,i™.ed scope by .he specfic 

e^bodimen. descnbed he.in. Indeed, various ™odif,ca,ions of — 
addi„on .0 .hose descnhed herein w,U become apparen, .o .hose s^ e m .he ^ 
0 from .he foregoing descrip.,on. Such .odif,ca.ions are ,n.e„ded .o fall 

^rsXuons are ci,ed here,„, .he discosures of wh,ch are 
incorporated by reference in their entireties. 
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